Selective advantage for multicellular replicative strategies: A two-cell example 
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This paper develops a quasispecies model where cells can adopt a two-cell survival strategy. Within 
this strategy, pairs of cells join together, at which point one of the cells sacrifices its own replicative 
ability for the sake of the other cell. We develop a simplified model for the evolutionary dynamics of 
this process, allowing us to solve for the steady-state using standard approaches from quasispecies 
theory. We find that our model exhibits two distinct regimes of behavior: At low concentrations 
of limiting resource, the two-cell strategy outcompetes the single-cell survival strategy, while at 
high concentrations of limiting resource, the single-cell survival strategy dominates. The single-cell 
survival strategy becomes disadvantageous at low concentrations of limiting resource because the 
energetic costs of maintaining reproductive and metabolic pathways approach, and may even exceed, 
the rate of energy production, leaving little excess energy for the purposes of replicating a new cell. 
However, if the rate of energy production exceeds the energetic costs of maintaining metabolic 
pathways, then the excess energy, if shared among several cells, can pay for the reproductive costs 
of a single cell, leaving energy to replicate a new cell. Associated with the two solution regimes of 
our model is a localization to delocalization transition over the portion of the genome coding for the 
multicell strategy, analogous to the error catastrophe in standard quasispecies models. The existence 
of such a transition indicates that multicellularity can emerge because natural selection does not act 
on specific cells, but rather on replicative strategies. Within this framework, individual cells become 
the means by which replicative strategies are propagated. Such a framework is therefore consistent 
with the concept that natural selection does not act on individuals, but rather on populations. 
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One of the most interesting questions under investi- 
gation in evolutionary biology is the emergence of coop- 
eration and multicellularity in biological systems 0, Q 
(and references therein) . While the emergence of certain 
types of cooperative behavior, such as division of labor, 
are reasonably well understood phenomena, the evolution 
of multicellular organisms is a more difficult question. 

With division of labor, a group of cells can more effi- 
ciently metabolize environmental resources than if they 
worked alone, and so it is in each cell's replicative interest 
to cooperate with other cells. In the case of multicellu- 
lar organisms, however, certain cells forgo their ability to 
replicate, so that other cells in the organism can survive 
and reproduce. This is clearly against the replicative in- 
terests of the non-replicating cells, a situation that makes 
the strategy prone to defections. Indeed, defection from 
a multicellular survival strategy, otherwise known as can- 
cer, is a common phenomenon in multicellular organisms. 

Nevertheless, in certain environments, there must exist 
selective pressures driving the emergence of multicellular 
organisms. Perhaps one of the clearest demonstrations of 
such selective pressures is the existence of the organism 
Dictyostelium discoideum, commonly known as a cellular 
slime mold. The slime mold has been the focus of consid- 
erable research (it is an NIH model organism), because it 
lives at the border between unicellular and multicellular 
life: When conditions are favorable, the slime mold ex- 
ists as a collection of free-living, single-celled organisms. 
However, when the slime mold cells are stressed, say by 
depletion of some necessary resource, they respond by 
coalescing into a differentiated, multicellular organism. 



When conditions improve, the slime mold reproduces by 
sporulation (3j|. 

One of the interesting features of the slime mold is 
that, during the differentiation process, some cells in- 
evitably forgo replication for the sake of the multicellular 
structure 3]. In this Letter, we attempt to elucidate the 
selective pressures driving this behavior by considering 
a highly simplified model motivated by the slime mold 
life cycle, one which we believe illustrates the underlying 
principles involved in the emergence of multicellularity. 
We emphasize, however, that this Letter does not con- 
sider the evolutionary dynamics modeling how such be- 
havior could have emerged in the first place. We should 
also emphasize that, although our model is motivated by 
the slime mold life cycle, it is believed that the ability to 
engage in multicellular behavior is ubiquitous amongst 
single-celled organisms, and may even characterize the 
organization of bacterial biofilms . 

For our model, we consider a population of organisms 
whose genomes consist of three distinct genes (or more 
appropriately, genome regions): (1) A reproduction re- 
gion, denoted an, coding for all the various cellular ma- 
chinery involved in the growth and reproduction of the 
organism. (2) A metabolism region, denoting ctmj coding 
for all the various cellular machinery involved in procur- 
ing food from the environment, and metabolizing it to 
release the energy required for various cellular processes 
(as in the metabolism of glucose and the storage of the 
energy into ATP). (3) A multicellular region, coding for 
the machinery necessary to implement the two-cell sur- 
vival strategy. Among the various machinery required to 
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implement the two-cell survival strategy is a switch that 
causes one of the cells to shut off its reproductive path- 
ways, and to devote itself to metabolizing food from the 
environment for the sake of the other cell. This part of 
the genome is denoted by as- 

The full cellular genome is denoted by cr = (jR(JMcrs- 
We assume that there exist master sequences, cr_R.o, cta/.o, 
and (7s,0) corresponding to gene sequences coding for the 
appropriate enzymes necessary for the proper function- 
ing of the various systems. In this single- fitness-peak 
approximation, any mutation to these master sequences 
leads to the loss in function of the corresponding system. 
A cell for which a = (TB„o(7MflUs.a replicates via a two- 
cell strategy, whereby it seeks out and joins with another 
cell with an identical genome. The pathways encoded 
within as,o cause one of the cells to shut off its repro- 
ductive pathways, and to devote its metabolic efforts to 
sustaining the other cell (a possible algorithm that the 
switch could implement is to instruct a cell to shut off 
its reproductive pathways if the reproductive pathways 
of the other cell is on, and to turn on its reproductive 
pathways if the reproductive pathways of the other cell 
is off. The only two stable solutions to this algorithm 
are where one of the cells has its reproductive pathways 
on, while the other cell has its reproductive pathways 
off. Presumably, although the two cells join with both of 
their reproductive pathways on, random fluctuations will 
break the symmetry and lead to collapse into an equilib- 
rium state). 

A cell for which a = <tr,o<7m,oo's, <^s <^s,o, replicates 
independently of the other cells. It is assumed that all 
other genotypes, with faulty copies of either reproductive 
and metabolic pathways, do not replicate at all. 

The cells metabolize a single external resource, which 
provides both the energy and the raw materials for all 
the cells' needs. If we let the basic unit of energy be 
the amount of energy released by metabolism of a set 
quantity of resource, then up to a conversion factor it is 
possible to measure all energy and accumulation changes 
in terms of the resource itself. Of course, because only 
that quantity of resource that has been metabolized has 
provided the cell with energy and raw materials, our basic 
measurement unit becomes the quantity of metabolized 
resource. 

It is assumed that resource is metabolized by each cell 

via a two-step process: (1) A binding step, whereby the 
resource binds to certain receptors, which then pass on 
the resource for metabolism. (2) A metabolism step, 
whereby the resource bound the receptors is then metab- 
olized. Assuming each of the steps is an elementary reac- 
tion, we obtain a metabolism rate r(c) of the Michaelis- 
Mcnten form ac/{l + (3c), where c denotes the concentra- 
tion of resource in the environment. Note that this form 
of the metabolism rate has the property that it reaches a 
maximal value as the concentration of external resource 
becomes infinite. This makes sense, since a cell cannot 



metabolize an external resource at arbitrarily high rates. 
It should be noted, however, that our expression for r(c) 
is not the only one that exhibits this saturation property, 
but it is one of the simplest expressions possible. 

In order to replicate a cell, the various cellular systems 
must be replicated. Each system has an associated build 
cost (measured in units of metabolized resource). Thus, 
if PR, pm, and ps denote the build costs of the reproduc- 
tive, metabolic, and two-cell pathways, respectively, then 
the total cost required to build a new cell replicating via 
the single-cell strategy is given by pr + pm, while the 
total cost required to build a new cell replicating via the 
two-cell strategy is given hy pR + pM + Ps- 

In addition to the build costs for the various systems, 
each system has an associated fixed cost, corresponding 
to the energy and resources required to maintain sys- 
tem function. These fixed costs arise because the various 
components of the cellular systems have intrinsic decay 
rates (protein degredation, auto- hydrolysis of mRNAs, 
etc.), and in the case of switches that have to respond to 
changes in the external environment or the internal states 
of the cell, there is a minimal rate of energy consumption 
associated with measuring ambient conditions. 

There is also an operating cost associated with each 
subsystem, corresponding to energy and resource costs 
associated with carrying out a given system task. For 
example, the replication machinery consumes energy in 
order to process a certain amount of metabolized resource 
toward the construction of a new cell. The metabolic 
pathways require energy to break-down the external re- 
source (in chemistry, such costs are known as activation 
barriers) . 

Let cjR denote the cost of replication per unit of me- 
tabolized resource incorporated into a new cell, and let 
ojm denote the cost of metabolizing one unit of resource. 
Then for a cell replicating via the single-cell strategy, 
the total amount of resource that must be metabolized 
is given by, p == (1 -|- ojm){pr + Pm)- The net rate of 
energy production is given by (1 — WM)'r'{c). Since repli- 
cation and metabolism consume energy at a rate given 
hy p = PR + Pm , the net rate of energy accumulation is 
given by, (1 — u!M)oec/ (1 -|- /3c) — pR — Pm- The replication 
time is therefore given by, 

"■"-^^ = (l-a;MMc)-p 
yielding a first-order growth-rate constant of 

M(c) = ^ = (2) 

For the two-cell replication strategy, the cell that is 
replicating in the cell-pair must accumulate a total of 
(1 -I- ujm){pr + Pm + ps) = p -I- Ap of metabolized re- 
source. The net rate of energy production from both 
cells is given by 2(1 — ujm)t{c). Since only one of the 
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Single-cell replication strategy 



Two-cell replication strategy 




FIG. 1: Comparison of the single-cell and the two-cell repli- 
cation strategies. 



cells in the replicating cell-pair has active reproductive 
pathways, the total energy consumption rate is given by 
2(p-Ap) = pR+2pM+2ps, where Ap = {l/2){pR-2ps). 
Therefore, the replication time is given by. 
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2{l -LOM)r{c)-p + ^p 
yielding a first-order growth-rate constant of 



^ (1 - ujM)r{c) - p + Ap 
p + Ap 



(3) 



(4) 



We should note that we are implicitly assuming in this 
derivation that the amount of time it takes for two cells 
to find each other and combine is negligible compared 
to the replication time. We are also assuming that the 
costs associated with transporting metabolized resource 
from one cell to another is negligible. Finally, we are also 
assuming that the reproductive pathways can process the 
metabolized resource as fast as it is produced. 

We let ni denote the number of organisms with the 
single-cell genome. Because we are neglecting the time 
it takes for two organisms replicating via the two-cell 
strategy to find each other and to combine, we may as- 
sume that all such cells exist in the two-cell state. We 
therefore define n2 to be the number of such cell-pairs 
in the system. Then define the total population of cells 
n = ni + 2n2, and population fractions xi = ni/n and 
X2 — 1 — xi = 2n2/n. 

We also assume that cells may generate mutated 
daughter cells as a result of point-mutations during repli- 
cation. For simplicity, we assume that replication of the 
master sequences ct/j_o and (Tm,o is error-free, so that we 
do not need to consider cells with faulty reproduction 
or metabolic pathways (this situation can be created by 
assuming that the portions of the genomes coding for 
reproduction and metabolism are short, so the probabil- 
ity of mutations occurring in these regions is negligible). 



However, we assume that the per-base replication error 
probability in as is given by e. We let L denote the 
length of as, and define p = Le. We then consider the 
infinite sequence length limit, while holding p constant. 
In this limit, the probability of correctly replicating as 
is given hy ps — e^^. We then have, 

^ = (i^iic) - 'iit))xi + ^K2(c)(l -~ps)x2 (5) 



^ = (^'«2(c)ps - Kit))x2 



dn 
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K(t)n 



(6) 
(7) 



where K{t) = ki{c)xi + 2 ^2 (2)2^2 • 

The above population fractions will evolve to a steady- 
state 0, whose properties we can readily determine: The 
condition that dx2/dt = at steady-state implies that 
either X2 — or R{t = 00) — ^K2{c)ps- If 2^2 = 0, then 
dxi/dt — implies that R(t = 00) = ki(c). 

For a steady-state to be stable to perturbations, we 
must have R{t = 00) > ki(c), \k2{c)ps- Therefore, at 
steady-state we have, R{t ~ 00) — max{iK2(c)p5, ki(c)}. 
Using the formulas for (1/2)k2 (c)p5 and ki(c), and as- 
suming that Ap > 0, we have that 



^^2{c)ps > Ki(c), X2 > 0, if < r(c) < r(c)= (8) 



H2{c)ps < Ki{c), X2 = 0, if r(c) > r{c)-_ 



where, 



r{c)-_ 



1 -ps 



P P+^P 



1 - LUM I - PS 



(9) 



(10) 



p+Ap 



Let zi^i denote the fraction of the population whose 
genome (Tfl,oO'M,oO's is such that DH{as, 0-3,0) — I, where 
I > 0. Then, using similar techniques to those found in 
4], it is possible to show that. 
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^K2(c)a;2^e ''-|-Ki(c)e ^ ^ ] —zi,i_i>-K{t)zi^i 

i'=o 

(11) 



Defining the localization length (l) via, 

00 

then at steady-state. 



1=1 



K{t = 00) 



K{t = 00) — Kl(c) 



(12) 



(13) 



which is finite as long as K{t = 00) = ^K2{c)ps > ki(c), 
and 00 otherwise. 

In other words, once the selective advantage for repli- 
cating via the two-cell survival strategy disappears, the 
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FIG. 2: Illustration of the two solution domains for our 
multi/single-cell replication model. Below r(c) = , the frac- 
tion of cells replicating via the two-cell strategy is a positive 
fraction of the population. Above r(c) = , a localization to de- 
localization transition occurs over as, and the fraction of cells 
replicating via the two-cell strategy drops to 0. 

portion of the genome coding for this strategy undergoes 
a localization to delocalization transition, analogous to 
the error catastrophe (it is also similar to a phenomenon 
known as "survival of the flattest" ) 0, [3 ■ 

Figure 2 shows the various solution regimes as a func- 
tion of r(c) and ps- Note that, for a given value of ps, 
there exists a low-concentration regime where the frac- 
tion of cells adopting the two-cell strategy is positive. In 
this regime, there is a selective advantage for a genome to 
maintain a functional copy of the multicell switch 175,0 ■ 
At a critical concentration given by r(c) = ^(c) = , re- 
sources are sufficiently plentiful that it becomes disad- 
vantageous to instruct a cell to sacrifice its own repro- 
ductive ability for the sake of the other one. The reason 
for this is that, although the average fixed cost per cell 
is lower with the two-cell strategy, the cost of having to 
repHcate the strategy outweighs the savings in fixed costs 
when resources are plentiful. Thus, once r(c) > r{c)=, 
the fraction of cells adopting the two-cell strategy disap- 
pears, and the population consists entirely of cells repli- 
cating via the single-cell strategy. 

If r(c) > r(c)= for ps = 1, then varying ps at this 
concentration will never lead to a selective advantage for 
maintaining a two-cell survival strategy. If r(c) < r(c)= 
for Ps = 1, but r(c) > r{c)= for ps = 0, then for suffi- 
ciently large ps there will exist a finite fraction of the cells 
which replicate via the two-cell strategy. As ps drops be- 
low some critical value, denoted ps,crit, the probability of 
incorrectly duplicating the strategy becomes sufficiently 
large that the fraction of cells replicating via the two-cell 
strategy disappears. This concentration regime is inter- 
esting because it corresponds to a regime where replicat- 
ing via the two-cell strategy is actually the advantageous 



one, but it might not be observed because of replication 
errors. 

Finally, once r(c) drops below r(c)= |ps=o, then ki(c) < 
0, so as long as \k2{c)ps > 0, there will exist a selec- 
tive advantage for maintaining the two-cell strategy in 
the population. Due to mutation, this will also lead to 
the maintenance of the single-cell strategy, although this 
strategy is not self-sustaining in the population. 

If, due to saturation, r(oo) is finite, then one possibility 
is that the parameters of our model are such that r(oo) < 
r(c)= at a given ps- Then for this value of ps, there will 
exist a selective advantage for the two-cell strategy no 
matter what the external concentration of resource (the 
cells cannot metablize the resources sufficiently fast to 
eliminate the selective advantage for multicellularity) . 

The results of our model show that natural selection 
does not act on individual cells, but rather on the sur- 
vival strategy as encoded for in crs.o- Individual cells then 
are more properly viewed as vehicles by which the multi- 
cell strategy is passed on to the next generation. When 
food resources become limited (or when the cells cannot 
rapidly metabolize the food resources present), the effec- 
tive growth rate of the multicell strategy is competitive 
with the total growth rate of the single-cell strategies, re- 
sulting in its preservation in the population. Essentially, 
it becomes advantageous (from the point of view of the 
strategy) for several cells to pool their resources together 
for the purposes of replicating a single cell. When food 
becomes more plentiful, or when the rate of replication 
errors reaches a threshold value, the selective advantage 
for retaining the strategy disappears, and delocalization 
occurs over the corresponding region of the genome. 

A potentially interesting avenue of future research is to 
determine whether there exist natural bounds on the pos- 
sible multicellular replicative strategies, and whether it is 
possible, using thermodynamics and information theory, 
to connect these natural bounds to basic physicochemical 
properties of the constituent reaction networks. 
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